Search CORE

31 research outputs found

Machine Learning Models that Remember Too Much

Author: Biggio B.
Bugiel S.
Dinh T. T. A.
Fredrikson M.
Graham-Cumming J.
Han S.
Kloft M.
Krizhevsky A.
Krizhevsky A.
Lin Z.
Lowd D.
Maas A. L.
Ohrimenko O.
Schuster F.
Torres-Arias S.
Vapnik V.
Zhang C.
Publication venue
Publication date: 22/09/2017
Field of study

Machine learning (ML) is becoming a commodity. Numerous ML frameworks and services are available to data holders who are not ML experts but want to train predictive models on their data. It is important that ML models trained on sensitive inputs (e.g., personal images or documents) not leak too much information about the training data. We consider a malicious ML provider who supplies model-training code to the data holder, does not observe the training, but then obtains white- or black-box access to the resulting model. In this setting, we design and implement practical algorithms, some of them very similar to standard ML techniques such as regularization and data augmentation, that "memorize" information about the training dataset in the model yet the model is as accurate and predictive as a conventionally trained model. We then explain how the adversary can extract memorized information from the model. We evaluate our techniques on standard ML tasks for image classification (CIFAR10), face recognition (LFW and FaceScrub), and text analysis (20 Newsgroups and IMDB). In all cases, we show how our algorithms create models that have high predictive power yet allow accurate extraction of subsets of their training data

arXiv.org e-Print Archive

Crossref

Picking on the family: disrupting android malware triage by forcing misclassification

Author: Aafer
Alejandro Calleja
Alejandro Martín
Arp
Arzt
Aydogan
Barreno
Barreno
Biggio
Biggio
Biggio
Biggio
Biggio
Budhraja
Chakradeo
Chin
Chinavle
Dalvi
Dash
David Clark
Deshotels
Enck
Feng
Fratantonio
Gandotra
Garcia
Gordon
Huang
Héctor D. Menéndez
Juan Tapiador
Kearns
Klieber
Lakhotia
Laskov
Laskov
Lowd
Maiorca
Meng
Octeau
Pastrana
Perdisci
Ptacek
Rasthofer
Russu
Sivanandam
Suarez-Tangil
Valiant
Valiant
Vidas
Vigna
Wei
Xiao
Xu
Xue
Yang
Yang
Zhang
Zheng
Zhou
Zhou
Publication venue: 'Elsevier BV'
Publication date: 14/12/2017
Field of study

Machine learning classification algorithms are widely applied to different malware analysis problems because of their proven abilities to learn from examples and perform relatively well with little human input. Use cases include the labelling of malicious samples according to families during triage of suspected malware. However, automated algorithms are vulnerable to attacks. An attacker could carefully manipulate the sample to force the algorithm to produce a particular output. In this paper we discuss one such attack on Android malware classifiers. We design and implement a prototype tool, called IagoDroid, that takes as input a malware sample and a target family, and modifies the sample to cause it to be classified as belonging to this family while preserving its original semantics. Our technique relies on a search process that generates variants of the original sample without modifying their semantics. We tested IagoDroid against RevealDroid, a recent, open source, Android malware classifier based on a variety of static features. IagoDroid successfully forces misclassification for 28 of the 29 representative malware families present in the DREBIN dataset. Remarkably, it does so by modifying just a single feature of the original malware. On average, it finds the first evasive sample in the first search iteration, and converges to a 100% evasive population within 4 iterations. Finally, we introduce RevealDroid*, a more robust classifier that implements several techniques proposed in other adversarial learning domains. Our experiments suggest that RevealDroid* can correctly detect up to 99% of the variants generated by IagoDroid

Crossref

UCL Discovery

Middlesex University Research Repository

Universidad Carlos III de Madrid e-Archivo

Bostonia: The Boston University Alumni Magazine. Volume 8

Author: Blatt William M.
Dupertuis Samuel
Erskine John
Geddes James
Hanley Pat
Kramer Murray
Lowd Emma F.
Marsh Daniel L.
Mengel Vivian
Morris Albert
Mosely Eleanor
Rowland William D.
Sutherland John P.
Whittier Florence
Publication venue: Boston University
Publication date: 01/01/1934
Field of study

Founded in 1900, Bostonia magazine is Boston University's main alumni publication, which covers alumni and student life, as well as university activities, events, and programs

Boston University Institutional Repository (OpenBU)

Learning compact Markov logic networks with decision trees

Author: A. Srinivasan
B. Taskar
C. Boutilier
D. Chickering
D. Fierens
D. Fierens
D. Heckerman
D. Lowd
D. Poole
F. J. Provost
H. Blockeel
H. Chen
H. Khosravi
H. Poon
H. Zhang
Hassan Khosravi
I. Bratko
J. D. Ullman
J. Neville
J. Neville
J. Pearl
J. Quinlan
J. R. Quinlan
J. Y. Halpern
Jianfeng Hu
K. Kersting
L. Getoor
L. Getoor
L. Getoor
L. Mihalkova
L. Ngo
M. Chiang
M. Hall
M. Sebag
N. Friedman
O. Frank
O. Schulte
Oliver Schulte
P. Domingos
R. Kohavi
R. She
S. Dzeroski
S. Kok
S. Kok
S. Kok
S. Natarajan
T. Khot
Tianxiang Gao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Learning graphical models for relational data via lattice search

Author: A. C. Klug
A. Popescul
A. Srinivasan
B. Taskar
D. Chickering
D. Fierens
D. Fierens
D. Geiger
D. Heckerman
D. Jensen
D. Koller
D. Lowd
D. Poole
H. Chen
H. Khosravi
H. Khosravi
H. Lodhi
H. Poon
Hassan Khosravi
I. Bratko
J. D. Ullman
J. Domke
J. Neville
J. Neville
J. Pearl
J. Quinlan
K. Kersting
K. R. Apt
L. Getoor
L. Getoor
L. Getoor
L. Mihalkova
L. Ngo
M. Biba
M. P. Wellman
M. Schmidt
M. Y. Vardi
N. Friedman
O. Frank
O. Schulte
O. Schulte
O. Schulte
Oliver Schulte
P. Domingos
R. Agrawal
R. E. Tillman
R. Frank
S. Kok
S. Kok
S. Kok
S. Kok
S. Natarajan
S. Russell
T. N. Huynh
V. Lifschitz
W. Laer Van
X. Yin
X. Yin
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Learning a Markov Logic network for supervised gene regulatory network inference

Author: A Ben-Hur
A de la Fuente
A Giordana
A Margolin
A Srinivasan
B Aranda
C Elkan
C Sima
Christel Vrain
Céline Brouard
D Lowd
D Maglott
D Tao
David Castel
DC Liu
ER DeLong
F Mordelet
F Mordelet
F Niu
Florence d’Alché-Buc
GRG Lanckriet
H Blockeel
H Kashima
H Kashima
H Khosravi
J Davis
J Qian
J Schafer
JJ Faith
JP Vert
Julie Dubois
K Bleakley
K Langlands
KC Chen
L Cerulo
L Cerulo
L De Raedt
M Bansal
M Ceccarelli
M Gönen
M Hue
M Kubat
M Levine
M Richardson
M Richardson
Marie-Anne Debily
N Friedman
P Singla
P Zoppoli
S Martin
S Muggleton
S Muggleton
T Huynh
T Kato
T Kutsia
TGO Consortium
TN Huynh
TS Gardner
VA Huynh-Thu
X Robin
Y Benjamini
Y Li
Y Yamanishi
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Structured machine learning: the next ten years

Author: A. Amini
A. Fern
A. Fern
A. Paes
A. Rosenfeld
A. Tamaddoni-Nezhad
A. Tamaddoni-Nezhad
B. Milch
C. Bryant
C. Parker
C. Parker
D. Bertsekas
D. Lowd
D. Poole
E. Shapiro
F. DiMaio
G. DeJong
G. E. Hinton
G. Plotkin
H. Daumé III
H. Pasula
I. Tsochantaridis
J. Cussens
J. Cussens
J. Duchi
J. Kubica
J. Leathwick
J. Neville
J. Nocedal
J. Quinlan
K. Crammer
K. Kersting
K. Kersting
L. Getoor
L. Getoor
L. Raedt De
Lise Getoor
M. Reid
M. Richardson
M. Wellman
N. Friedman
N. Lavrač
P. Domingos
P. Finn
P. Winston
Pedro Domingos
Prasad Tadepalli
R. Fikes
R. King
S. Colton
S. Dz̆eroski
S. Kok
S. Kok
S. Macskassy
S. Muggleton
S. Muggleton
S. Muggleton
S. Muggleton
S. Muggleton
S. Muggleton
S. Wrobel
Stephen Muggleton
T. G. Dietterich
T. G. Evans
T. Gärtner
T. M. Mitchell
T. Sato
Thomas G. Dietterich
V. Costa
Y. Anzai
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Probability Density Estimation by Perturbing and Combining Tree Structured Markov Networks

Author: A. Dempster
B. Efron
C. Chow
D. Chickering
D. Lowd
D. Madigan
D. Madigan
G. Ridgeway
J. Pearl
L. Breiman
N. Friedman
P. Geurts
R. Cowell
R. Robinson
R. Rubinstein
S. Kullback
V. Auvray
Publication venue
Publication date: 01/01/2009
Field of study

peer reviewe

Crossref

Open Repository and Bibliography - Liège

All You Need Is "Love": Evading Hate Speech Detection

Author: Brennan M.
Brown A
Davidson T.
Lowd D.
Marpaung J.
Merity S.
Michel J.-B.
Stern H.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2018
Field of study

| openaire: EC/H2020/688061/EU//TagItSmartWith the spread of social networks and their unfortunate use for hate speech, automatic detection of the latter has become a pressing problem. In this paper, we reproduce seven state-of-the-art hate speech detection models from prior work, and show that they perform well only when tested on the same type of data they were trained on. Based on these results, we argue that for successful hate speech detection, model architecture is less important than the type of data and labeling criteria. We further show that all proposed detection techniques are brittle against adversaries who can (automatically) insert typos, change word boundaries or add innocuous words to the original hate speech. A combination of these methods is also effective against Google Perspective - a cutting-edge solution from industry. Our experiments demonstrate that adversarial training does not completely mitigate the attacks, and using character-level features makes the models systematically more attack-resistant than using word-level features.Peer reviewe

arXiv.org e-Print Archive

Crossref

Aaltodoc Publication Archive

Archivio istituzionale della ricerca - Università di Padova

Probability Density Estimation by Perturbing and Combining Tree Structured Markov Networks

Author: A. Dempster
B. Efron
C. Chow
D. Chickering
D. Lowd
D. Madigan
D. Madigan
G. Ridgeway
J. Pearl
L. Breiman
N. Friedman
P. Geurts
R. Cowell
R. Robinson
R. Rubinstein
S. Kullback
V. Auvray
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Crossref